Semantic Annotation, Indexing, and Retrieval
Identifieur interne : 000239 ( Main/Exploration ); précédent : 000238; suivant : 000240Semantic Annotation, Indexing, and Retrieval
Auteurs : Atanas Kiryakov [Bulgarie] ; Borislav Popov [Bulgarie] ; Damyan Ognyanoff [Bulgarie] ; Dimitar Manov [Bulgarie] ; Angel Kirilov [Bulgarie] ; Miroslav Goranov [Bulgarie]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2003.
Abstract
Abstract: The Semantic Web realization depends on the availability of critical mass of metadata for the web content, linked to formal knowledge about the world. This paper presents our vision about a holistic system allowing annotation, indexing, and retrieval of documents with respect to real-world entities. A system (called KIM), partially implementing this concept is shortly presented and used for evaluation and demonstration. Our understanding is that a system for semantic annotation should be based upon specific knowledge about the world, rather than indifferent to any ontological commitments and general knowledge. To assure efficiency and reusability of the metadata we introduce a simplistic upper-level ontology which starts with some basic philosophic distinctions and goes down to the most popular entity types (people, companies, cities, etc.), thus providing many of the inter-domain common sense concepts and allowing easy domain-specific extensions. Based on the ontology, an extensive knowledge base of entities descriptions is maintained. Semantically enhanced information extraction system providing automatic annotation with references to classes in the ontology and instances in the knowledge base is presented. Based on these annotations, we perform IR-like indexing and retrieval, further extended using the ontology and knowledge about the specific entities.
Url:
DOI: 10.1007/978-3-540-39718-2_31
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000438
- to stream Istex, to step Curation: 000438
- to stream Istex, to step Checkpoint: 000196
- to stream Main, to step Merge: 000260
- to stream Main, to step Curation: 000239
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Semantic Annotation, Indexing, and Retrieval</title>
<author><name sortKey="Kiryakov, Atanas" sort="Kiryakov, Atanas" uniqKey="Kiryakov A" first="Atanas" last="Kiryakov">Atanas Kiryakov</name>
</author>
<author><name sortKey="Popov, Borislav" sort="Popov, Borislav" uniqKey="Popov B" first="Borislav" last="Popov">Borislav Popov</name>
</author>
<author><name sortKey="Ognyanoff, Damyan" sort="Ognyanoff, Damyan" uniqKey="Ognyanoff D" first="Damyan" last="Ognyanoff">Damyan Ognyanoff</name>
</author>
<author><name sortKey="Manov, Dimitar" sort="Manov, Dimitar" uniqKey="Manov D" first="Dimitar" last="Manov">Dimitar Manov</name>
</author>
<author><name sortKey="Kirilov, Angel" sort="Kirilov, Angel" uniqKey="Kirilov A" first="Angel" last="Kirilov">Angel Kirilov</name>
</author>
<author><name sortKey="Goranov, Miroslav" sort="Goranov, Miroslav" uniqKey="Goranov M" first="Miroslav" last="Goranov">Miroslav Goranov</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:00A9BD5FDC711FD7F241B0BC4EB6E2CA3DE733E9</idno>
<date when="2003" year="2003">2003</date>
<idno type="doi">10.1007/978-3-540-39718-2_31</idno>
<idno type="url">https://api.istex.fr/document/00A9BD5FDC711FD7F241B0BC4EB6E2CA3DE733E9/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000438</idno>
<idno type="wicri:Area/Istex/Curation">000438</idno>
<idno type="wicri:Area/Istex/Checkpoint">000196</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000196</idno>
<idno type="wicri:doubleKey">0302-9743:2003:Kiryakov A:semantic:annotation:indexing</idno>
<idno type="wicri:Area/Main/Merge">000260</idno>
<idno type="wicri:Area/Main/Curation">000239</idno>
<idno type="wicri:Area/Main/Exploration">000239</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Semantic Annotation, Indexing, and Retrieval</title>
<author><name sortKey="Kiryakov, Atanas" sort="Kiryakov, Atanas" uniqKey="Kiryakov A" first="Atanas" last="Kiryakov">Atanas Kiryakov</name>
<affiliation wicri:level="3"><country xml:lang="fr">Bulgarie</country>
<wicri:regionArea>Ontotext Lab, Sirma AI EOOD, 138 Tsarigradsko Shose, 1784, Sofia</wicri:regionArea>
<placeName><settlement type="city">Sofia</settlement>
<region nuts="2">Sofia-ville (oblast)</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Bulgarie</country>
</affiliation>
</author>
<author><name sortKey="Popov, Borislav" sort="Popov, Borislav" uniqKey="Popov B" first="Borislav" last="Popov">Borislav Popov</name>
<affiliation wicri:level="3"><country xml:lang="fr">Bulgarie</country>
<wicri:regionArea>Ontotext Lab, Sirma AI EOOD, 138 Tsarigradsko Shose, 1784, Sofia</wicri:regionArea>
<placeName><settlement type="city">Sofia</settlement>
<region nuts="2">Sofia-ville (oblast)</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Bulgarie</country>
</affiliation>
</author>
<author><name sortKey="Ognyanoff, Damyan" sort="Ognyanoff, Damyan" uniqKey="Ognyanoff D" first="Damyan" last="Ognyanoff">Damyan Ognyanoff</name>
<affiliation wicri:level="3"><country xml:lang="fr">Bulgarie</country>
<wicri:regionArea>Ontotext Lab, Sirma AI EOOD, 138 Tsarigradsko Shose, 1784, Sofia</wicri:regionArea>
<placeName><settlement type="city">Sofia</settlement>
<region nuts="2">Sofia-ville (oblast)</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Bulgarie</country>
</affiliation>
</author>
<author><name sortKey="Manov, Dimitar" sort="Manov, Dimitar" uniqKey="Manov D" first="Dimitar" last="Manov">Dimitar Manov</name>
<affiliation wicri:level="3"><country xml:lang="fr">Bulgarie</country>
<wicri:regionArea>Ontotext Lab, Sirma AI EOOD, 138 Tsarigradsko Shose, 1784, Sofia</wicri:regionArea>
<placeName><settlement type="city">Sofia</settlement>
<region nuts="2">Sofia-ville (oblast)</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Bulgarie</country>
</affiliation>
</author>
<author><name sortKey="Kirilov, Angel" sort="Kirilov, Angel" uniqKey="Kirilov A" first="Angel" last="Kirilov">Angel Kirilov</name>
<affiliation wicri:level="3"><country xml:lang="fr">Bulgarie</country>
<wicri:regionArea>Ontotext Lab, Sirma AI EOOD, 138 Tsarigradsko Shose, 1784, Sofia</wicri:regionArea>
<placeName><settlement type="city">Sofia</settlement>
<region nuts="2">Sofia-ville (oblast)</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Bulgarie</country>
</affiliation>
</author>
<author><name sortKey="Goranov, Miroslav" sort="Goranov, Miroslav" uniqKey="Goranov M" first="Miroslav" last="Goranov">Miroslav Goranov</name>
<affiliation wicri:level="3"><country xml:lang="fr">Bulgarie</country>
<wicri:regionArea>Ontotext Lab, Sirma AI EOOD, 138 Tsarigradsko Shose, 1784, Sofia</wicri:regionArea>
<placeName><settlement type="city">Sofia</settlement>
<region nuts="2">Sofia-ville (oblast)</region>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Bulgarie</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2003</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">00A9BD5FDC711FD7F241B0BC4EB6E2CA3DE733E9</idno>
<idno type="DOI">10.1007/978-3-540-39718-2_31</idno>
<idno type="ChapterID">31</idno>
<idno type="ChapterID">Chap31</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: The Semantic Web realization depends on the availability of critical mass of metadata for the web content, linked to formal knowledge about the world. This paper presents our vision about a holistic system allowing annotation, indexing, and retrieval of documents with respect to real-world entities. A system (called KIM), partially implementing this concept is shortly presented and used for evaluation and demonstration. Our understanding is that a system for semantic annotation should be based upon specific knowledge about the world, rather than indifferent to any ontological commitments and general knowledge. To assure efficiency and reusability of the metadata we introduce a simplistic upper-level ontology which starts with some basic philosophic distinctions and goes down to the most popular entity types (people, companies, cities, etc.), thus providing many of the inter-domain common sense concepts and allowing easy domain-specific extensions. Based on the ontology, an extensive knowledge base of entities descriptions is maintained. Semantically enhanced information extraction system providing automatic annotation with references to classes in the ontology and instances in the knowledge base is presented. Based on these annotations, we perform IR-like indexing and retrieval, further extended using the ontology and knowledge about the specific entities.</div>
</front>
</TEI>
<affiliations><list><country><li>Bulgarie</li>
</country>
<region><li>Sofia-ville (oblast)</li>
</region>
<settlement><li>Sofia</li>
</settlement>
</list>
<tree><country name="Bulgarie"><region name="Sofia-ville (oblast)"><name sortKey="Kiryakov, Atanas" sort="Kiryakov, Atanas" uniqKey="Kiryakov A" first="Atanas" last="Kiryakov">Atanas Kiryakov</name>
</region>
<name sortKey="Goranov, Miroslav" sort="Goranov, Miroslav" uniqKey="Goranov M" first="Miroslav" last="Goranov">Miroslav Goranov</name>
<name sortKey="Goranov, Miroslav" sort="Goranov, Miroslav" uniqKey="Goranov M" first="Miroslav" last="Goranov">Miroslav Goranov</name>
<name sortKey="Kirilov, Angel" sort="Kirilov, Angel" uniqKey="Kirilov A" first="Angel" last="Kirilov">Angel Kirilov</name>
<name sortKey="Kirilov, Angel" sort="Kirilov, Angel" uniqKey="Kirilov A" first="Angel" last="Kirilov">Angel Kirilov</name>
<name sortKey="Kiryakov, Atanas" sort="Kiryakov, Atanas" uniqKey="Kiryakov A" first="Atanas" last="Kiryakov">Atanas Kiryakov</name>
<name sortKey="Manov, Dimitar" sort="Manov, Dimitar" uniqKey="Manov D" first="Dimitar" last="Manov">Dimitar Manov</name>
<name sortKey="Manov, Dimitar" sort="Manov, Dimitar" uniqKey="Manov D" first="Dimitar" last="Manov">Dimitar Manov</name>
<name sortKey="Ognyanoff, Damyan" sort="Ognyanoff, Damyan" uniqKey="Ognyanoff D" first="Damyan" last="Ognyanoff">Damyan Ognyanoff</name>
<name sortKey="Ognyanoff, Damyan" sort="Ognyanoff, Damyan" uniqKey="Ognyanoff D" first="Damyan" last="Ognyanoff">Damyan Ognyanoff</name>
<name sortKey="Popov, Borislav" sort="Popov, Borislav" uniqKey="Popov B" first="Borislav" last="Popov">Borislav Popov</name>
<name sortKey="Popov, Borislav" sort="Popov, Borislav" uniqKey="Popov B" first="Borislav" last="Popov">Borislav Popov</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000239 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000239 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Ticri |area= TeiVM2 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:00A9BD5FDC711FD7F241B0BC4EB6E2CA3DE733E9 |texte= Semantic Annotation, Indexing, and Retrieval }}
This area was generated with Dilib version V0.6.31. |